A GridFTP Transport Driver for Globus XIO
نویسندگان
چکیده
GridFTP is a high-performance, reliable data transfer protocol optimized for high-bandwidth wide-area networks. Based on the Internet FTP protocol, it defines extensions for highperformance operation and security. The Globus implementation of GridFTP provides a modular and extensible data transfer system architecture suitable for wide area and high-performance environments. GridFTP is the de facto standard in projects requiring secure, robust, high-speed bulk data transport. For example, the high energy physics community is basing its entire tiered data movement infrastructure for the Large Hadron Collider computing Grid on GridFTP; the Laser Interferometer Gravitational Wave Observatory routinely uses GridFTP to move 1 TB a day during production runs; and GridFTP is the recommended data transfer mechanism to maximize data transfer rates on the TeraGrid. Commonly used GridFTP clients include globus-url-copy, uberftp, and the Globus Reliable File Transfer service. In this paper, we present a Globus XIO based client to GridFTP that provides a posix-like (open/close/read/write) interface to the users. Such a client greatly eases the addition of GridFTP support to third-party programs, such as SRB and MPICH-G2. Further, this client provides an easier and familiar interface for applications to efficiently access remote files. We compare the performance of this client with that of globus-url-copy on multiple endpoints in the TeraGrid infrastructure. We perform both memory-to-memory and disk-to-disk transfers and show that the performance of this posix-like client is comparable to that of globus-url-copy. We also show that our GridFTP client significantly outperforms the GPFS WAN on the TeraGrid.
منابع مشابه
Improving GridFTP Performance Using The Phoebus Session Layer linkbordercolor
Phoebus is an infrastructure for improving end-to-end throughput in high-bandwidth, long-distance networks by using a “session layer” protocol and “gateways” in the network. Phoebus has the ability to dynamically allocate network resources and to use segmentspecific transport protocols between gateways, as well as to apply other performance-improving techniques on behalf of the user. One of the...
متن کاملCapability analyses of mdtmFTP
A decade ago it was theorized that if enough parallel processing power was thrown at an I/O problem, the bottleneck would shift towards the NIC’s link speed [1]. However, today NUMA architecture changes this paradigm, it also matters on which cores these threads are located and how they are managed [2]. Transport protocols are also critical in high latency networks, TCP’s congestion control alg...
متن کاملRXIO: Design and implementation of high performance RDMA-capable GridFTP
For its low-latency, high bandwidth, and low CPU utilization, Remote Direct Memory Access (RDMA) has established itself as an effective data movement technology in many networking environments. However, the transport protocols of grid run-time systems, such as GridFTP in Globus, are not yet capable of utilizing RDMA. In this study, we examine the architecture of GridFTP for the feasibility of e...
متن کاملExtending Globus to support Multicast Transmission
The Globus Toolkit is an open source software toolkit that has become the de-facto standard for middleware used to build Grid services. Globus XIO is an eXtensible Input/Output library for Globus. It provides a POSIX-like API to swappable I/O implementations – essentially “I/O plugins” for Globus. TCP-XM is a modified TCP engine that supports multicast transmission. We have taken this protocol ...
متن کاملGlobus GridFTP : What ’ s New in 2007 ( Invited Paper )
GridFTP is a high-performance, secure, reliable data transfer protocol optimized for high-bandwidth wide-area networks. It is based on the Internet FTP protocol, and it defines extensions for high-performance operation and security. The Globus implementation of GridFTP provides a software suite optimized for the gamut of data access issues—from bulk file transfer to the details of getting data ...
متن کامل